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Abstract 

A model  is  proposed  for  concept  learning  and  subsequent 
recognition  and  classification  of  OLD  and  NEW  exemplars.  The 
model,  called  the  "property-set  model,"  assumes  that  a learned 
exemplar  is  encoded  in  memory  as  a set  of  the  component 

properties  and  combinations  of  properties  of  the  exemplar. 

Hecoqni.tion  of  a presented  exemplar  is  assumed  to  be  an 
increasing  function  of  the  memory  strengths  of  its  component 

property-sets,  while  classification  cf  the  exemplar  is  determined 
by  its  most  diagnostic  pr opert v -se t . This  model  is  contrasted 
with  a number  of  alternative  models,  including  prototype-plus- 
transformation,  feature-frequency,  and  nearest  neighbor  models. 
In  an  experimental  evaluation  of  alternative  models,  subjects 
attempted  to  learn  two  concepts  by  classifying  exemplars  in  an 
anticipation  paradigm.  They  then  performed  recognition  and 
classification  tasks  with  particular  exemplars.  Cn  a within- 
sublect  basis,  the  property-set  model  was  the  best  predictor  of 
both  recognition  and  classification  performance. 
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Recent  research  in  concept  learring  and  concept  utilization 
has  addressed  the  question  of  what  information  is  stored  in 
memory  when  only  exemplars  of  a concept,  but  not  the  concept 
itself,  are  presented  to  a subject.  Several  models  have  been 
proposed  to  account  for  the  nature  of  the  stored  information,  how 
it  is  used  by  subjects  to  classify  familiar  and  unfamiliar 
exemplars,  and  how  it  is  used  to  make  recognition  judgments  about 
familiar  and  novel  exemplars  (Posner,  1969;  Bransford  and  Franks, 
1971;  Franks  and  Bransford,  1971;  Reitman  and  Bower,  1973; 
Neumann,  1974).  In  general,  these  models  fall  intc  two  classes: 
models  based  on  STRENGTH  of  the  stored  information  and  models 
based  on  DISTANCE  of  presented  items  from  stored  information. 
Distance  models  assume  that  one  or  more  representatives  of  the 
presented  exemplars  are  stored  in  memory.  The  representatives 
may  be  either  actual  exemplars  or  some  mat  heita  t ica  1 combination 
of  the  attributes  of  the  exemplars.  Subsequently  presented 
exemplars  are  recognized,  or  classified  according  to  their 
distance  from  the  stored  representatives  (with  models  differing 
on  the  proposed  distance  metric).  Strength  models,  on  the  other 
hand,  assume  that,  more  specific  information  about  exemplais,  such 
as  their  component  features,  is  stored  in  memory  with  strength 
proportional  to  the  amount  of  experience.  Performance  on 
recognition  or  classif ication  of  a subsequently  presented 
exemplar  is  then  assumed  to  be  a function  of  the  strength  in 
memory  of  its  component  features. 

Experimental  results  have  been  reported  it.  support  of 
various  versions  of  not  h classes  of  models.  in  a typical 
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paradiqin  used  to  support  distance  models,  subjects  are  repeatedly 
presented  stimuli  such  as  dot  patterns  (Posner,  Goldsmith,  & 
Melton,  1967;  Posner  and  Keele,  1968),  geometric  forms  (Franks 
and  Bransford,  1971),  or  compound  linguistic  propositions 
(Bransford  and  Franks,  1971).  The  stimuli  consist  of  prototypic 
patterns  to  which  varying  amounts  of  noise  or  numbers  of 
transformations  have  been  applied  to  produce  deviations  from  the 
prototypes.  The  prototypes  themselves  are  not  presented  to  the 
subjects.  On  a subsequent  recognition  test,  subjects  categorize 
as  OLD  or  NEW  the  prototypes  and  distortions  of  the  prototypes 
that  have  or  have  not  been  presented  previously.  In  such 
experiments,  subjects  recognize  the  prototype  with  highest 
confidence,  even  though  it  has  never  been  presented. 
Furthermore,  in  the  latter  two  studies  subjects'  recognition 
confidence  for  test  exemiplars  declined  monotoriicall  y as  the 
number  of  transformations  from  the  prototype  to  the  test  exemplar 
i ncreased . 

Franks  and  Bransford  (1971)  explained  these  results  by 
proposing  that  during  presentation  of  the  original  set  of 
exemplars,  subgects  synthesized  a single  internal 
representation  of  each  prototype.  At  the  same  time,  subjects 
induced  and  stored  representations  of  the  transformations  that 
had  been  applied  to  the  prototypes  to  produce  the  presented 
exemplars.  Durinq  the  recognition  task,  subjects  evaluated  a 
test  item  according  to  its  transformational  distance  from  the 
most  similar  stored  prototype. 

PLanks  and  Bransford  briefly  considered  a feature- 
frequency  (strength)  model  as  an  alternative  to  the  prototype- 
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plus-transformation  (distance)  model  they  proposed.  Features 
were  defined  to  be  individual,  position-dependent,  geometric 
figures.  On  16  comparisons  between  pairs  of  test  items  with 
equal  feature  frequencies  fcut  different  transformational 
distances,  fourteen  of  the  comparisons  favored  the  prototype- 
plus-  transformat  ion  model.  In  addition,  Frants  and  Dranstord 
tested  recognition  of  several  "non-cases"--stimuli  that  could  not 
be  produced  by  the  transformation  rules,  but  which  possessed 
combinations  of  the  same  features  as  valid  exemplars.  The 
feature-frequency  model,  as  they  formulated  it,  predicted 
recognition  of  non-cases  according  to  the  presentation 
frequencies  of  their  component  features,  while  the  prototype- 
plus-  transformation  model  predicted  non-recognition.  Again, 
their  results  favored  the  prototype-plus-transformation  model. 

The  rejection  of  the  class  of  strength  models  was 
considered  premature  by  some  investigators.  Hayes-Foth  and 
Hayes-Roth  (1973)  arqued  that  a version  of  the  feature-frequency 
model  treating  single  figures  and  pairs  of  adjacent  figures  as 
features  is  consistent  with  results  of  the  16  comparisons 
employed  by  Franks  and  Pransford  to  reject  the  simple  feature- 
frequency  model.  It  should  also  be  noted  that  the  non-cases 
Franks  and  drarisfotd  tested  necessarily  involved  pairs  of 
adjacent  qeometric  figures  or  much  lower  frequency  than  the 
corresponding  pairs  in  the  valid  exemplars.  As  a result,  the  low 
recognition  ratings  qiven  to  non-cases  are  predicted  by  this 
modified  feature-frequency  model  as  well  as  by  the  prototype- 
plus-transformation  nodel.  In  addition,  Peitman  and  Bower  (1973) 
and  Neumann  (1974)  have  provided  support  for  other  feature- 
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trequency  models  in  slightly  modified  versions  of  the  Branstord 
and  Franks  (1971}  and  Franks  and  Lrarsford  (1971)  experiments. 

The  results  of  these  studies  indicate  that  it  is  possible  to 
design  experiments  whose  results  support  distance  models, 
strength  models,  or  both.  A major  difficulty  in  designing 
experiments  to  distinguish  the  two  types  of  models  arises 
from  the  substantial  agreement  in  their  prediction  of 
recognition  ratings.  In  general,  as  the  transformational 
distance  of  a test  exemplar  from  the  prototype  of  a set  of 
presented  exemplars  is  decreased,  the  presentation 
frequencies  of  its  component  features  increase.  Thus,  both 
classes  of  models  usually  predict.  decreasing  recognition 
ratings  as  transformational  distance  is  increased.  The  purpose 
of  the  present  study  was  to  test  several  alternative  distance  and 
strength  models  using  an  experimental  paradigm  that  independently 
manipulated  transformational  distance  from  a prototype  and 
frequency  of  occurrence  of  componential  features.  In  addition,  a 
particular  strength  model,  called  the  "property-set.  model,"  is 
proposed  that  accounts  for  earlier  results  and  is  comparatively 
evaluated  against  the  alternative  strength  and  distance  models. 

The  property-set  model,  referred  to  elsewhere  as  the 
"schematic  model"  (Hayes-Roth,  1974;  Hayes-Roth  and  Hayes-Roth, 
1973),  assumes  that  an  exemplar  is  encoded  in  memory  as  a set  of 
properties.  The  term  "property"  is  used  here  to  distinguish  the 
information  structures  we  propose  from  features.  "Feature" 
conventionally  refers  to  the  presence  of  a particular  attribute 
(a  unary  predicate).  We  use  the  term  property  to  include 
features  and  any  higher-order  predicates  that  can  be  asserted 
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about  the  exemplar.  The  particular  properties  encoded  for  a 
qiven  exemplar  are  assumed  to  depend  upon  the  individual's 
prior  knowledqe  and  choice  of  encodinq  strategies,  the  context 
in  which  the  exemplar  is  presented,  etc. 

The  model  assumes  that  a memory  representation  is  created 
for  each  element  of  the  powerset  of  properties  of  an  exemplar. 
That  is,  a memory  representation  is  created  for  all  individual 
properties  and  conjunctions  of  properties.  He  refer  to  each 
element  of  the  powerset  of  encoded  properties  as  a "property- 
set.”  For  example,  qiven  the  exemplar  "a  car  that  is  red," 
conventional  feature  models  would  assume  encoding  of  two 
features,  red  and  car.  The  property-set  model,  on  the  other 
hand,  assumes  encoding  of  (at  least)  three  property-sets:  red, 
car,  and  red  S car.  Each  property-set  is  associated  with  a 
strength  variable  that  is  incremented  whenever  that  subset  of 
properties  is  included  amonq  newly  encoded  exemplars.  Suppose, 
for  example,  one  were  learning  the  concept  "what  Jane  likes"  by 
means  of  three  exemplars,  two  red  cars  and  one  small  car.  The 
property-sets  encoded  from  these  exemplars  would  have  strengths 
proportional  to  their  frequency  of  occurrence  (i.e.,  red[2], 
smallfl],  carf3  1,  red  6 car[  2 1,  small  S car[  1 ])  . The  magnitude 
of  the  increment  in  property-set  strength  depends  upon  contextual 
and  subject-specific  factors.  The  model  also  assumes  that  the 
name  of  the  concept  represented  by  the  presented  exemplar  is 
associated  with  each  of  the  stored  property-sets.  So,  for 
example,  "what  Jane  likes"  would  be  associated  with  each  of  the 
property-sets  specified  above.  He  now  consider  how  these  stored 
property-sets  are  used  to  make  recognition  judgments  and 


classification  decisions  for  particular  exemplars  presented  to 
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su  bjects. 

Recognition  of  an  exemplar  is  assumed  to  be  an  increasing 
function  of  the  strengths  of  its  property-sets.  An  exemplar  X 
should  be  recognized  better  than  an  exemplar  Y only  when  each 
property-set  associated  with  X is  at  least  as  strong  as  the 
corresponding  propeLty-set  associated  with  Y,  with  strict 
inequality  in  at  least  one  case.  when  such  a relationship 
obtains  between  X and  Y,  we  will  say  that  X dominates  Y.  The 
model  predicts  equal  recognition  of  X and  Y if  corresponding 
pr ope r ty -se ts  have  equal  strengths.  By  the  term  "corresponding 
property-set"  we  mean  one  that  includes  features  from  the  same 
semantic  categories. 

For  example,  suppose  that  after  experiencing  the  exemplars 
(of  "what  Jane  likes")  red  car,  red  car,  and  small  car,  the 
learner  were  given  a recognition  test  that  included  the  test 
items  green  car,  red  car,  and  red  wagon.  These  three  test  items 
would  be  encoded  by  proper ty- se ts  with  the  following  strengths: 
green  car  (qreen[0  1,  car[3  1,  green  fi  car[  0 ])  , red  car  (red[2], 
carf  3 1,  red  6 ca  r[  2 1)  , red  wagon  (red[  2 ],  wagcn[0],  red  6 
wagonrO]).  "Red  car"  should  be  recognized  mote  confidently  than 
either  "green  car"  or  "red  wagon",  since  red[2]  is  stronger  than 
qreenfOl  and  as  strong  as  red[  2 ],  car[3]  is  as  strong  as  car[3] 
and  stronqer  than  wagon[0"),  and  red  G car[2]  is  stronger  than 
green  G carf  0 1 and  red  S waqon[0].  The  model  makes  no 
assumptions  for  how  to  combine  property-set  strengths.  Hence,  it 
is  unable  to  predict  whether  green  car  or  ted  wagon  should  be 
recoqnized  more  confidently  (qreen[0]  is  weaker  than  red[2], 
carf  31  is  stronger  than  wagonfO],  while  green  G car[0  ] and  red  G 
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wagonfO}  are  equal  in  strenqth). 

In  this  example,  exemplars  of  only  one  class  {"what  Jane 
likes”)  were  learned.  As  a result,  recognition  judgments  were 
based  on  the  strengths  of  property-sets  associated  only  with  that 
class.  What  happens  when  exemplars  of  two  or  more  classes  have 
been  learned?  There  are  two  possibilities:  recognition  judgments 
could  be  based  on  the  strengths  of  property-sets  associated  with 
all  learned  classes  or  only  with  the  class  to  which  the  test  item 
belongs.  Both  variants  are  considered  here  and  are  referred  to 
as  the  "simple  property-set  model"  and  the  "within-class 
property-set  model.” 


Classification  of  an  exemplar  requires  the  learner  to  decide 
in  which  of  a set  of  classes  the  exemplar  belongs.  This  process 
is  assumed  by  the  model  to  be  determined  by  the  most 
diagnostic  property-set  for  each  exemplar.  The 
d iaqnostici ty  of  a property-set  for  any  class  is  an  increasing 
function  of  its  associative  strength  to  that  class  and  a 
decreasing  function  of  its  associative  strength  to  other  classes. 
Hence  the  most  diagnostic  property-sets  are  those  that  occur  only 
within  exemplars  of  a single  class,  while  the  least  diagnostic 
property-sets  occur  equally  often  among  exemplars  of  all  classes. 
The  model  assumes  that  the  strength  of  association  of  ptoperty- 
sets  to  alternative  classes  is  determined  by  a classical 
likelihood  estimate. 

As  an  illustration,  suppose  that  a learner  were  presented 
the  exemplars  red  car,  red  car,  and  small  car  for  "what  Jane 
likes,"  and  the  exemplars  red  bike  and  blue  wagon  for  "what  Sue 
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likes."  How  would  the  learner  subsequently  answer  the  question: 
"Who  likes  a red  waqon?"  Accordinq  to  the  present,  model,  red 
waqon  would  be  encoded  as  property-sets  with  the  following 
strengths  of  association  to  each  of  the  two  classes:  (1)  "what 
Jane  likes":  red[2  1,  wagonf  0 ],  red  6 wagonf  0 ];  (2)  "what  Sue 
likes":  redf  11,  wagonf  1],  red  6 wagon[  0 ].  The  property-set.  red  C 
waqon  has  no  diagnostic  power  at  all,  because  it  has  zero 
strength  of  association  to  both  classes.  The  property-set  red 
has  strenqth  2 for  "what  Jane  likes"  and  1 for  "what  Sue  likes." 
The  property-set  waqon  has  strength  association  0 for  "what 
Jane  likes"  and  1 for  "what  Sue  likes."  According  to  the 
classical  likelihood  estimate,  wagon  has  greater  relative 
strenqth  of  association  (-1/1)  to  "what  Sue  likes"  than  "red"  has 
to  "what  Jane  likes"  (=2/3).  Thus,  it  is  more  diagnostic  and 
should  determine  the  classification  response.  That  is,  the 
learner  should  judge  that  it  is  Sue  who  likes  a red  wagon. 

The  present  experiment  was  designed  to  distinguish  the 
property-set  models  of  recognition  and  classification 
performance  from  the  prototype-plus-transformation  model. 
Subjects  learned  class  concepts  during  a training  session  in 
which  exemplars  were  presented  for  classification  and  feedback 
was  provided.  Then  subjects  provided  recognition  confidence 
judgments  for  OLD  and  NEW  exemplars  whose  feature  frequencies  and 
transformational  distances  from  the  class  prototypes  were 
orthogonally  manipulated.  This  manipulation  guaranteed 
differential  predictions  by  the  prototype-plus-transformation 
model  and  the  property-set  model  for  subjects*  recognition 
performance.  In  particular,  the  prototype-plus-transformation 


model  predicts  recognition  confidence  to  be  solely  a function  of 
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transf ormational  distance,  while  the  property-set  irodel  predicts 
recoqnition  confidence  to  depend  only  on  prior  property- 
set  frequencies.  Finally,  subjects  were  presented  exemplars  for 
classification  into  the  learned  classes.  Again,  the  prototype- 
plus-transformation  model  predicts  performance  on  the  basis  of 
distance  from  the  nearest  prototype,  while  the  property-set  model 
predicts  classif ica tion  to  be  an  increasing  function  of  the 
classical  likelihood  estimator  of  the  most  diagnostic  property- 
set. 

In  addition  to  these  two  models,  24  other  plausible  strength 
and  distance  models  for  recognitior  and  10  other  models  for 
classification  were  comparatively  evaluated.  The  various 
recognition  models  are  presented  and  described  briefly  in 
Appendix  I.  Recognition  models  R1  and  R2  are  the  simple 
property-set  model  and  within-class  property-set  model  described 
above.  Model  R3  is  the  prototype-plus-transformation  distance 
model.  Models  R4-R10  are  alternative  distance  models  that  make 
varying  assumptions  about  what  stored  information  the  test 
exemplar  is  compared  to.  For  models  Ri-RIO  the  distance  between 
two  exemplars  is  defined  to  be  the  number  of  features  that  are 
different  between  the  two  items.  Distance  models  R11-R18  are 
identical  to  models  R3-R10  in  their  assumptions  about  what 
information  is  compared  to  the  test  exemplar,  but  these  models 
compute  distance  by  a different  metric.  Models  R19-R26  are 
alternative  stronqth  models  that  make  varying  assumptions  about 
what  features  or  combinations  of  features  are  encoded  from 
exemplars  and  which  of  these  features  are  criterial  in  making 
recognition  judgments.  R19-R24  are  different  versions  of  simple 
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feature  models,  while  P25-R26  are  property-set  models  that  assume 
only  the  most  Irequent  property  set  is  criterial  in  making 
recognition  judqments.  In  Appendix  I the  predictions  for  each  of 
the  models  tor  subjects'  recognition  performance  are  outlined. 

For  the  classification  task,  12  models  were  tested  in  all. 
These  models  are  presented  in  Appendix  II.  Model  Cl  is  the 
property-set  model  described  above  that  utilizes  the  classical 
likelihood  estimate  of  the  most  diagnostic  prope r ty -set . Model 
C2  is  the  standard  protot ype - plus -t r ansf crma tion  model.  Models 
C3-C4  are  alternative  distance  models  that,  as  in  the  recognition 
models  above,  make  varying  assumptions  about  what  test  items  are 
compared  to  in  memory.  Models  C10-C12  are  alternative  strength 
models  that  vary  in  theii  assumptions  about  what  features  are 
stored  and  how  presentation  frequencies  influence  classification 
decisions. 

Method 

SMijgSts*  One- hundred-e ight  high  school  students  served  as 
subjects.  Individual  honoraria  of  $5.GC  were  contributed 
to  a student  fund.  In  addition,  $5.00  bonuses  were  paid  to  1U 
students  for  qood  performance. 

•Materials.  The  experimental  materials  consisted  of  132 
exemplars  of  one  of  three  concepts  subjects  were  to  learn  and  be 
tested  on  during  the  experiment.  fach  exenplar  was  a description 
of  a fictitious  individual,  including  the  individual's  surname, 
aqe,  education,  marital  status,  and  hotfcy.  The  surname  and  hobby 
were  distractor  features  only;  the  values  of  the  other  features 
were  manipulated  as  independent  variables.  Each  of  these 
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features  could  have  one  of  four  values  as  follows:  age:  30,  40, 
50,  60;  education:  junior  high,  high  school,  trade  school, 
colleqe;  marital  status:  single,  married,  divorced,  widowed.  The 
four  values  of  each  significant  feature  may  be  represented 
symbolically  by  the  numbers  1-4;  hence,  stimulus  111  might  be  the 
exemplar  "John  Doe,  30  years  old,  junior  high  education,  single, 
plays  chess."  The  assignment  of  the  symbolic  values  1-4  to  the 
values  of  each  significant  feature  was  randomized  for  each 
subject  in  the  experiment.  Similarly,  132  arbitrarily  selected 
surnames  and  three  hobbies  (chess,  sports,  stamps)  were  randomly 
assigned  to  the  exemplars  for  each  subject. 

The  three  concepts  by  which  the  exemplars  could  be 
classified  were  membership  in  Club  1,  membership  in  Club  2,  or 
membership  in  neither  club.  Stimuli  111,  222,  333,  and  444  were 
defined  to  be  the  prototype  of  Club  1,  the  prototype  of  Club  2, 
neutral  feature  values,  and  neither-club  feature  values, 
respectively.  The  two  prototypes  111  and  222  represented 
archetypical  members  of  Clubs  1 and  2.  The  rules  governing  club 
membership  were  as  follows:  If  the  number  of  1's  (2’s) 
exceeds  the  number  of  2's  (1's)  in  an  exemplar,  and  there  are 
no  4 ' s,  the  exemplar  is  in  Club  1 (2);  if  the  number  of  1's 
equals  the  number  of  2's  and  there  are  no  4*s,  the  exemplar 


can  be 

i n 

either  club. 

each 

with 

probability 

.5;  if  a 

present. 

the 

individual  is 

in  nei 

ther 

cl  u b. 

(The 

presence 

one  or 

more 

3's  had  no 

imp  1 ica 

tion ; 

i.e.  , 

those 

feature  va 

were  irrelevant  to  Club  1-Club  2 discrimination.) 


The  exemplars  tor  the  experiment  were  transtorniations  of  the 
two  club  prototypes.  These  were  generated  by  replacing  one  or 
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two  of  the  pcototypic  features  (for  example,  the  1 ' s)  with  some 
other  features.  These  replacement  features  were  either  from  the 
other  club  prototype  (2's),  from  the  set  of  irrelevant  features 
(3's),  or  from  the  set  of  features  precluding  membership  in 
either  club  (4  ’ s)  . 

Design.  Each  subject  was  presented  132  exemplars  to 
classify  durinq  training,  and  was  subsequently  presented  a set 
consistinq  of  OLD  and  NEW  exemplars  for  recognition  and 
classification  judgments.  The  exemplars  were  carefully 
constructed  to  vary  prope l ty- se t frequencies  and  transformational 
formational  distance  from  Club  1 or  Club  2 prototypes.  This  was 
accomplished  by  controlling  the  sources  of  the  replacement 
features  and  the  frequency  with  which  replacement  features 
appeared  in  the  classification  exemplars  used  to  teach 
subjects  the  concepts.  For  example,  in  Table  1,  which 
schematizes  the  desiqn  of  the  experiment,  it  may  be  noted  that 
exemplars  112  and  113  are  both  one  transformation  away  from  the 
Club  1 prototype.  However,  the  two  exemplars  have  different 
presentation  frequencies  and,  across  all  training  exemplars, 
their  component  property-sets  have  different  frequencies. 

Each  exemplar  was  either  one  or  two  substitution 
t ra  nsf  orma  ti  oris  removed  from  the  appropriate  (nearer)  Club  1 or 
Club  2 prototype  and  was  initially  classified  by  subjects 
zero,  one  or  ten  times,  as  indicated  in  Table  1.  Exemplars  that 
were  classified  more  than  once  appeared  in  the  context  of 
different  extraneous  features  (name  and  hobby)  on  each 
trial. 

The  recoqnition  and  final  classification  exemplars 
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TABLf  1 

Initial  Class  if  icat  inn  Exemplars  and  Test  Items 


Ex  emp  la  r 

Club 

Number  ot 
Initial 

Classifications 

Tested  for 
Recognition  and 
Final  C]  assif  icatiori 

112 

1 

10 

Yes 

12  1 

1 

10 

Yes 

21  1 

1 

10 

Yes 

1 1 3 

1 

1 

Yes 

131 

1 

1 

Y es 

31  1 

1 

1 

Yes 

133 

1 

1 

Yes 

313 

1 

1 

Y es 

331 

1 

1 

Yes 

221 

2 

10 

Yes 

212 

2 

10 

Yes 

122 

2 

10 

Yes 

223 

2 

1 

Y es 

232 

2 

1 

Yes 

322 

2 

1 

Yes 

233 

2 

1 

Yes 

323 

2 

1 

Yes 

332 

2 

1 

Yes 

132 

Ei ther 

10 

Yes 

321 

Either 

10 

Yes 

213 

Ei  ther 

10 

Yes 

231 


Either 


0 


Yes 
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123 

Either 

0 

Yes 

312 

Ei ther 

0 

Yes 

1 1 1 

1 

0 

Yes 

222 

2 

0 

Yes 

333 

Either 

0 

Yes 

444 

Neither 

0 

Yes 

411 

Neither 

1 

No 

422 

Neither 

1 

No 

141 

Neither 

1 

NO 

242 

Neither 

1 

No 

114 

Neither 

1 

No 

224 

Neit  her 

1 

No 

44  1 

Neither 

1 

No 

44  2 

Nei the  r 

T 

No 

144 

Neither 

1 

No 

244 

Neit  her 

1 

No 

414 

Neither 

1 

No 

424 

Nei the  r 

1 

No 

134 

Neither 

1 

No 

2 34 

Neither 

1 

No 

41  3 

Neither 

1 

No 

423 

Neither 

1 

No 

34  1 

N either 

1 

No 

342 

Neit  her 

1 

No 

124 

Neither 

1 

No 

214 

Neither 

1 

No 

412 

Neither 

1 

No 

421 

Neit  her 

1 

No 

24  1 

Neither 

1 

No 

16- 


142 

Neither 

1 

No 

143 

Neit  her 

1 

Nc 

243 

. Neither 

1 

No 

314 

Neither 

1 

No 

324 

Neither 

1 

NO 

431 

Neither 

1 

No 

432 

Neither 

1 

Nc 
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(also  indicated  in  Table  1)  included  the  18  initially  classified 
unambiguous  Club  1 and  Club  2 exemplars,  the  three  initially 
classified  exemplars  of  either  club,  three  previously 
unclassified  exemplars  of  either  club,  and  the  four  previously 
unclassified  prototypes.  Each  exemplar  was  one,  two,  or  three 
transformations  from  the  nearer  Club  1 or  Club  2 prototype  and 
had  been  initially  classified  zero,  one,  or  ten  times. 

Subjects  were  divided  randomly  into  two  groups  with  54  in 
each  qroup.  Group  1 had  no  prior  knowledge  of  the  nature  of  the 
prototypes.  Group  2 was  given  cards  describing  Club  1,  Club  2, 
and  neither-club  prototypes.  They  were  told  that  the  more  Club 
1 (2)  characteristics  individuals  had,  the  more  likely  they 
were  to  be  in  Club  1 (2) , but  that  the  presence  of  neither-club 
characteristics  indicated  membership  in  neither  club. 

Procedure.  Subjects  were  tested  in  groups  of  8-10.  They 
were  instructed  that  their  task  was  to  learn  to  classify 
individuals  into  each  of  three  groups:  Club  1 members.  Club  2 
members,  and  members  of  neither  club.  Subjects'  performance  was 
self-paced.  On  an  initial  classification  task,  subjects  worked 
through  132  pairs  of  classif ica t ion/teedback  cards,  attempting  to 
classify  each  individual  and  to  learn  from  the  feedback.  The 
exemplars  included  valid  members  of  club  1,  Club  2,  either 
club,  and  neither  club.  Fxemplars  valid  for  either  club  were 
followed  by  Club  1 and  Club  2 feedback  cards  equally  often,  but 
either  response  was  counted  as  correct  in  the  data 
analysis.  The  exemplars  were  punched  and  interpreted  on  blank 
computer  cards.  Each  card  listed  an  individual's  surname,  age, 
education,  marital  status,  and  hobby.  At  the  end  of  the  card. 
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there  was  a place  for  the  subject  to  circle  a 1,  2,  or  N, 
indicating  his  judgment  of  that  individual's  club  membership 
(Club  1,  2,  or  neither).  Each  classification  card  was  followed 
by  a feedback  card  that  was  identical  to  it,  except  that  it 
indicated  the  correct  club.  Following  initial  classification, 
subjects  were  given  a recognition  task  in  which  they  classified 
descriptions  of  individuals  as  OLD  or  NEW,  indicating  a 
confidence  level  between  1 (least)  and  5 (most).  Subjects  were 
provided  with  computer  cards  that  listed  "ANYONE"  instead  of  a 
specific  name,  followed  by  an  age,  education,  and  marital  status. 


At  the  end 

of 

the  card,  there  was 

a place  tor  the 

subject  to 

circle  a 

Y 

or  N,  indicating 

whether  he  did 

or 

did  not 

recognize 

that  conf iguration  of 

characteristics. 

and 

a place 

tor  the  subject  to  circle  an  L (least),  2,  3,  4,  or  M (most), 
indicating  his  confidence  in  his  recognition  judgment. 
Subseguently , subjects  were  given  a final  classif ication  task  in 
which  they  classified  descriptions  of  individuals  as  being  in 
Club  1 or  2,  again  indicating  confidence.  Responses  were  again 
written  on  computer  cards  listing  "ANYONE,"  followed  by  an  age, 
education,  marital  status,  and  location  for  the  subject  to  circle 
a 1 or  2 and  provide  a confidence  judgment.  At  the  end  of  the 
experiment  subjects  wrote  descriptions  of  the  characteristics  of 
typical  members  of  each  of  the  two  clubs. 

Results 

Subjects'  performance  on  both  initial  and  final 
classification  tasks  was  influenced  by  whether  or  not  they  knew 
the  prototypes  in  advance.  On  initial  classification,  the 
proportion  of  correct  responses  for  subjects  who  had  knowledge  of 
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the  protot 

y pes 

(.8 

2) 

was  reliably  higher  than 

for 

su 

bjects  who 

had  n 

o knowledge 

of  the  prototypes  (.62), 

t ( 10 

5)  * 

10.99,  p < 

.001 . 

Thi 

s re 

lat  i 

on 

ship  also  held  for  performance 

on 

the  final 

ciassi 

f ica 

tion 

ta 

sk 

(.87  vs.  .74),  t ( 105 

) = 

7.1 1 

, £ < . 001 . 

Howe ve 

this 

man 

ip 

ulation  did  not  produce  d 

if  fer 

ert  i 

al  results 

in  e 

va  lu 

at  in 

g 

th 

e fits  of  the  Lecognit 

ion 

and  final 

ciassi 

f ica 

tion 

da  t 

a 

to  the  various  strength  a 

nd  di 

stan 

ce  models. 

Theret 

ore. 

th 

e d 

at 

a have  been  combined  for 

these 

t wo 

coridit  ions 

in  the 

pre 

sent 

atio 

n 

of  results. 

T 

he  m 

ean 

reco 

gn 

ition  and  classification 

judgments 

for  each 

test 

exem 

plar 

ty 

pe 

are  given  in  Table  2.  T 

he  va 

lues 

listed  are 

Z-tran 

sf  or 

med 

sub  j 

ec 

t responses  averaged  ac 

ross 

all 

sub  jects. 

These 

rati 

nqs 

were 

u 

sed  to  evaluate  comparati 

vely 

t he 

predict  ions 

of  the 

var 

ious 

mod 

el 

s for  subject  performance 

Ta 

ble 

3 presents 

a com 

pari 

son 

of 

t 

he  simple  property-set 

mode 

i ( 

R 1 ) and  the 

protot 

ype- 

plus 

tra 

ns 

formation  model  ( R 3 ) on 

pairs 

of 

exemplars 

for  which  the  two  models  make  differential  predictions  for 
recoqnition  judqmentn.  Pairs  were  chosen  such  that  each  exemplar 
in  a pair  differed  from  the  prototype  on  the  same  feature  (e.g.  , 
211  and  311  both  differ  from  the  prototype  in  the  "age"  feature). 
The  paired  comparisons  are  of  tour  types:  both  exemplars  either 
one  or  two  transformations  away  from  the  nearer  club  prototype, 
tor  which  the  prototype-plus-transformation  model  predicts  equal 
recoqnition  performance;  one  exemplar  one  transformation  away 
from  the  prototype  compared  to  the  prototype,  for  which  the 
prototype  model  predicts  better  recognition  for  the  prototype; 
and  paired  exemplars  either  one  or  two  transformations  away  from 
a prototype,  for  which  the  prototype  model  predicts  better 


j 

1 
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TABLE  2 

Mean  Z-t.ransf  or  ired  Recognition  and  Pinal  Classification 
Ratings  for  Individual  Exemplars 


Test  Exemplar 
(Age,  Education, 
Marital  Status) 

fiecogni t ion 
a 

Rating 

Classif icat ion 
b 

Rating 

1 12 

3.27 

-2.43 

121 

1.  85 

-2.  4ft 

211 

3. 09 

-2.4ft 

1 1 3 

-0.0ft 

-2.57 

131 

. 88 

-2.  44 

311 

0.  1 8 

-2.44 

133 

- 3.  45 

-2.09 

.113 

-2.  29 

-2.  09 

331 

-2.  10 

-2.22 

221 

1.  73 

2.12 

2 12 

1. 07 

2.  3? 

122 

2.  1 7 

2.22 

223 

-0.  9 1 

2.08 

212 

0.  01 

1.97 

322 

0.  10 

2.1  1 

213 

- 1. 69 

1.94 

3 21 

-2.  22 

1.78 

132 

-1.  74 

1 .95 

112 

1.  13 

0.  00 

121 

1.58 

0.02 

213 

1.  30 

-0.09 

211 

-0.  6 1 

0.01 

123 

-1.23 

-0.  09 

312 

-0.  95 

0.10 

1 1 1 

0.  49 

-2.8? 

222 

1.50 

2.  19 

333 

-4.  19 

1.78 

4 4 4 

-0.  92 

1.12 

Original  scale:  -5  = NE*--Most  Confident,  ...» 

♦5  = OLD--Most  Confident 

b 

Original  scale:  -5  = Club  1 — dost  Confident,  ...» 
♦5  = Club  2--Most  Confident 
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TABLE  .1 


Critical  Comparisons  Between  Property-Set  and  Prototy pe-Plus- 
Transf ormation  Models  of  Recognition  of  Test  Exemplars 


Predictions 


Pai red 
Comparison 


Exemplars 


Proper  t y- Set 
Model 


Pro  tot  y pe-P  las 
Transformation 


Property-Set 
Pr ed  ict ion 
Confirmed  (+)  or 
D isconf i rmed  (-) 


Both  items 

211:311 

> 

♦ 

1 transfor- 

121:131 

> 

- 

♦ 

mation  from 

112:113 

> 

♦ 

nearer  club 

122: 32? 

> 

♦ 

protot  ype 

212:232 

> 

♦ 

221:223 

> 

4- 

1 transfor- 

211:111 

> 

< 

4- 

mation  from 

121:111 

> 

< 

♦ 

prototype 

112:111 

> 

< 

•f 

compa  red 

122:222 

> 

< 

♦ 

to  prototype 

212:222 

> 

< 

- 

22  1 :222 

> 

< 

4- 

Both  items 

321:331 

> 

4- 

2 transfor- 

132:133 

> 

= 

4- 

mations  from 

213: 3 1 3 

> 

— 

4- 

nearer  club 

132:  332 

> 

= 

•f 

prototype 

213:233 

> 

- 

4- 

321  : 32  3 

> 

r 

4- 

2 transfor- 

321:311 

> 

< 

4* 

mations  from 

321 : 322 

> 

< 

4- 

prototype 

132:131 

> 

< 

4- 

compared  to 

132:232 

> 

< 

f 

1 transfor- 

213:113 

> 

< 

+ 

mation  from 
proto  type 

213:223 

> 

< 
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recoqnition  for  the  one-transformation  exemplar.  Due  to  the 
manipulation  of  the  property-set  frequencies  of  the  paired 
exemplars,  differential  predictions  are  made  in  all  cases  by 
the  property-set  model,  as  is  noted  in  Table  3.  On  23  of  the  24 
paired  comparisons,  the  predictions  of  the  property-set  model  are 
confirmed  and  the  predictions  of  the  prototy pe- p lus- 
transf ormation  model  are  disconfirmed  (p  < .01). 

Evaluating  the  goodness  of  fit  of  each  model  to  the  data 
depended  on  generating  unique  predictions  for  ratinq  comparisons 
on  pairs  of  particular  exemplars  tor  each  model,  as  shown  in 
Table  3 for  the  property-set  and  prototype-plus-transformation 
models.  However,  each  model  provided  a different  number  of 
pairwise  comparisons  among  items,  and  the  compared  items  varied 
for  each  model.  In  addition,  a model  might  predict  either 
equality  or  inequality  for  subjects'  performance  on  a pair. 
Therefore,  an  analytic  method  was  devised  for  assigning  a single 
overall  qoodness  of  fit  value  to  each  model  that  was  unbiased 
with  respect  to  the  type  and  number  of  predictions  made  by  a 
model.  The  fit  value  was  a number  between  0 and  1 that 
corresponded  to  the  proportion  of  pairwise  predictions 
disconfirmed  by  the  data.  Error  tolerance  regions  were  defined 
such  that  the  probability  of  disconf irming  an  equality  prediction 
by  chance  was  equal  to  the  probability  of  discontinuing  an 
inequality  prediction  by  chance. 


The  recognition  data  were  analyzed  by  computing  for  a given 
model  the  lit  values  tor  each  of  the  10R  subjects.  The  mean 
within-sub ject  fit  value  tor  each  of  the  26  recognition  models 
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TABLE  4 

Proportion  of  Pairwise  Recognition  Predictions 
Disconf i rmed  for  Each  Recognition  Model 


Mean  Within-  Standard  Fit  to  Mean  Test 

Model  Sub-ject  Fit  Error  Exemplar  Ratings 


Simple  Property-set 


R 1 

.31 

.0  12 

. 06 

Within-Class  Property 
R2 

-set 

. 30 

.011 

.06 

Prototype-plus-transfoLmation 
R 3 

.4  1 

. 009 

. 27 

Alternative  Distance 
R4 

Models 

. 44 

. 007 

. 45 

R5 

. 3 o 

. 0 10 

. 15 

R6 

.57 

. 010 

.49 

E7 

. 38 

. 0 10 

.09 

R8 

.38 

.010 

. 07 

R9 

.38 

.010 

.09 

RIO 

. 49 

. 007 

. 45 

R 1 7 

.43 

.011 

. 29 

R 12 

. 45 

. 008 

.46 

R 13 

. 38 

. 0 10 

. 19 

R 14 

.56 

.012 

.51 

8 15 

. 40 

.016 

. 11 

R 16 

. 38 

. 007 

. 10 

R 17 

.41 

.013 

. 13 

R 18 

.47 

. 007 

.41 

Alternative  Strength 
R 19 

Models 

.42 

. 007 

. 24 

R 20 

.41 

. 007 

. 25 

R 2 1 

.38 

.010 

.08 

R 22 

.41 

.011 

.09 

R 2 3 

. 50 

.015 

. 44 

R 24 

.47 

.012 

.47 

R 25 

. 44 

.013 

. 40 

R 26 

.48 

. 008 

. 40 
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that  the  simple  property-set  model  ( R 1 ) and  the  within-class 
property-set  model  (E2)  provide  a better  fit  to  the  data  than  all 
other  models,  including  the  protot y pe- p lus- t r ansf orma t ion  model 
(R3).  The  last  column  of  Table  4 provides  the  fit  of  the  models 
to  the  mean  recognition  ratings  across  subjects  (the  data  given 
in  the  first  column  of  Table  2).  Again,  the  simple  property-set 
model  and  the  within-class  property-set  model  provide  the  best 
tit  to  the  data. 

To  evaluate  the  statistical  reliability  of  these  results, 
the  fit  of  each  model  was  directly  compared  to  the  fit  of  each 
other  model  by  a matched- pai rs  t-test.  The  matched  pairs  used  in 
the  analysis  were  the  108  fits  of  the  two  models  to  be  compared 
to  the  individual  subject  recognition  ratings.  This  analysis  was 
repeated  for  all  pairs  of  26  models  in  order  to  identify  the 
best-fitting  model.  All  reported  significance  levels  were  p>  < 
.05  or  smaller. 

For  both  subjects  who  knew  the  prototypes  in  advance  and 
those  who  did  not,  the  simple  property-set  model  and  the  within- 
class  property-set  model  tit  the  data  reliably  better  than  each 
of  the  other  24  models.  The  fits  of  the  two  models  were  not 
significantly  different.  Furthermore,  by  performing  a flonte 
Carlo  analysis  it  was  found  that  both  these  models  fit  the  data 
reliably  better  than  chance  (p>  < .01).  The  flonte  Carlo  analysis 
compared  the  fit  of  the  model  to  observed  data  with  the  fits  when 
mean  recognition  ratings  were  randomly  assigned  to  items.  The 
attained  significance  level  of  the  model  (i.e.  , < .01)  is  the 
proportion  of  times  Landomized  ratings  provided  a better  tit  than 
did  the  actual  data. 


The  results  of  the  final  classification  task  are  given  in 
Table  5.  The  classif ication  data  were  subjected  to  the  same 
w ithiri-sub  ject  analysis  as  described  above  for  the  recognition 


data.  The 

first 

column 

of 

Table  5 gives  the  mean 

fit  for 

10R 

sub  jects 

o f 

each 

of  t he 

12 

classif ication 

models 

listed 

in 

Appendix 

1 1. 

The 

last  column 

in  Table  5 gives 

the  fit 

of  each 

of 

the  models  to  the  mean  classification  ratings  tot  exemplars 
across  all  subjects  (the  data  in  the  last  column  of  Table  2). 
While  several  of  the  models  appeal  to  fit  the  data  veil,  the  best 
fitting  model  was  the  classical  most  diagnostic  property-set 
model  (Cl).  The  similarity  in  the  fit  values  of  several  of  the 
classification  models  is  due  to  the  fact  that  cn  many  paired 
comparisons  used  in  the  analysis,  all  models  make  the  same 
predictions.  However,  on  those  comparisons  for  which  the  models 
make  differential  predictions,  the  classical  most  diagnostic 
property-set  model  provides  the  most  reliable  predictions. 
Matched-pairs  t-tests  were  performed  on  all  pairs  of  the  12 
classification  models  by  the  same  procedure  used  for  the 
recognition  data.  The  classical  most  diagnostic  property-set 
model  fit  the  data  significantly  tetter  than  each  of  the  other  11 
models,  as  well  as  fitting  the  data  reliably  better  than 
chance  (p  < .01)  by  a Monte  Carlo  analysis. 

Discussion 

The  results  suggest  that  initially  classified  exemplars 
can  be  powerful  determinants  of  subseguent  recognition  and 
classification  behavior.  This  appears  to  be  the  case  even 
when  subjects  know  a simple  c lassif icat ion  rule  in  advance  and 
so  need  not  pay  particular  attention  to  individual  exemplars 
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TABLE  5 

Proportion  of  Pairwise  classification  Predictions 
Disconfirmed  for  Each  Classification  Model 


(lean  within-  Standard  Fit  to  Mean  Test 

Model  Subject  Fit  Error  Exemplar  Ratings 
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durinq  initial  classification.  Further,  it  appears  that  the 
presentation  frequency  of  the  property-sets  cf  initially 
classified  exemplars  influence  subsequent  recognition  and 
classification  performance.  The  property-s<  model  described 
here  formalizes  these  factors.  Subjects  appear  to  encode  from 
presented  exemplars  tingle  features  and  conjunctions  of  those 
features  in  their  memory  representation  of  the  exemplars.  The 
assumption  that  all  combinations  of  features  are  part  of  the 
encoded  representation  distinguishes  this  model  from  other 
traditional  feature-frequency  models. 

For  example,  suppose  the  Clut  1 prototype  was  30  years  old, 
junior  high  school  education,  and  single;  the  Club  2 prototype 
was  50  years  old,  college  education,  and  married;  and  40  years 
old  was  acceptable  for  either  club.  At  the  end  of  the  initial 
classification  session,  a subject  would  have  seen  the  exemplar 
(1)  50  years  old,  junior  high  school  education,  single  10  times 
(associated  with  different  names  and  hobbies).  The  exemplar  (2) 
40  years  old,  junior  high  school  education,  single  would  have 
been  seen  once.  The  different  presentation  frequencies  would  be 
reflected  in  memory  as  differential  memory  strengths  for  the 
componential  property-sets.  Combining  these  differences  with  the 
effects  of  presentation  of  other  exemplars  the  subject  would  have 
seen  sharinq  some  of  these  property-sets,  the  memory  strength 
differences  relevant  to  exemplars  (1)  and  (2),  after  the  initial 
classification  session,  would  be;  50  years  old[50]  vs.  40  years 
oldf  20  1,  50  years  old  & junior  high  educat ionf 3 1 ] vs.  40  years 
old  5 junior  hiqh  educationf  3 1,  50  years  old  8 sinqle[21]  vs.  40 
years  old  f>  sinqlef  13],  and  50  years  old  & junior  high  education 
6 sinqlef  10  1 vs.  40  years  old  f.  junior  high  education  & 


sinqlefl].  All  other  corresponding  property-sets  tor  (1)  and  (2) 
had  identical  memory  strengths. 

During  recognition,  a subject's  confidence  in  having  seen  an 
exemplar  previously  is  a function  of  the  prior  presentation 
frequencies  of  the  component  property-sets  of  the  exemplar  either 
in  al^  presented  examplars  (simple  property-set  model)  or  in 
exemplars  of  the  same  class  as  the  test  exemplar  ( w i t h in-c lass 
property-set  model).  Since  the  strengths  associated  with  (1) 
dominate  those  of  (2),  both  models  correctly  predicted  better 
recognition  of  (1)  than  (2).  The  data  obtained  in  the  experiment 
are  not  sufficient  to  discriminate  between  these  two  models. 

However,  both  of  these  models  predict  the  data  more  reliably  than 
other  simple  feature-frequency  models,  the  prot oty pe- p 1 us- 
transf ormation  model,  or  other  strength  models. 

One  might  be  tempted,  based  on  this  example,  to  postulate 
recognition  based  only  on  frequency  of  presentation  of  the  entire 
exemplar;  that  is,  that  (1)  was  recognized  better  than  (2)  simply 
because  it  was  presented  ten  times  vs.  one  time.  In  fact,  this 
model  was  evaluated  as  one  of  the  alternative  strength  models 
( R 1 9 ) and  can  be  rejected  because  it  produced  a worse  fit  to  the 
data  than  the  property-set  models. 

The  property-set  model  also  provided  the  best  accounting  of 
final  classification  performance.  This  representation  was 
combined  with  a classification  rule  that  assumes  the  most 
diagnostic  property-set  of  the  test  exemplar  (as  determined  by  a 
classical  likelihood  estimate)  determines  how  it  is  classified. 

Suppose,  for  example,  the  subject  attempted  to  classify  the  NEW 

L.  . ...  _ ii 
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exerrplar  UO  yoars  old,  junior  high  school  education,  single 
(312).  The  most  diagnostic  property-set  for  Club  1 membership 
(that  is,  the  property-set  with  the  largest  ratio  of  strength 
contr i but  ions  from  club  1 exemplars  to  total  strength)  is  junior 
high  school  education  (=26/50).  The  most  diagnostic  property-set 
for  Club  2 membership  is  married  (=28/50).  Since  these  two 
likelihood  estimates  are  identical,  the  subject  should  classify 
the  exemplar  in  Club  1 or  Club  2 with  equal  probability,  a 
prediction  confirmed  by  the  data.  This  classification  model  was 
superior  to  the  distance  and  other  strength  models  in  predicting 
subjects'  performance.  Thus,  it  seems  reasonable  to  conclude  that 
the  property-set  model  provides  the  best  theoretical  explanation 
currently  available  for  both  recognition  and  classification 
performance. 

On  the  other  hand,  one  might  conceive  of  situations  in  which 
recognition  and  classification  performance  are  influenced  by 
raciables  in  addition  to  property-set  frequency.  Such  additional 
factors  determininq  subjects'  performance  might  include  conscious 
strategy  shifts  that  result  in  subjects'  attending  to  some  subset 
of  the  presented  features  of  the  stimuli,  complexity  and 
fuzziness  of  the  concepts  to  be  learned,  the  number  and  ratio  of 
relevant  and  irrelevant  dimensions  on  which  exemplars  vary,  total 
number  of  exemplars,  amount  of  related  prior  learning, 
performance  criteria  and  feedback,  and  separability  of  property 
dimensions.  Any  theory  that  proposes  to  give  a thorough  account 
of  concept  learning  must  address  the  effects  of  all  of  these  (and 
perhaps  other)  variables.  While  the  property-set  model  is 
amenable  to  elaboration  in  order  to  account  for  the  effects  of 
all  of  these  variables,  it  does  not  do  so  in  the  present 
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formulation. 

Finally,  it  should  be  pointed  out  that  vhile  the  property- 
sot  model  provides  the  best  fit  for  the  recognition  and 
classification  data,  many  of  the  other  models  tested  also  predict 
the  data  well  above  the  chance  level.  This  result  was  obtained 
despite  the  fact  that  feature  frequency  and  distance  from  the 
prototype  were  manipulated  in  a mariner  designed  to  produce 
differential  predictions  of  strength  and  distance  nodels.  This 
illustrates  the  fact  that  in  this  and  other  similar  concept 
learning  experiments,  the  predictions  of  any  of  a number  of 
models  are  likely  to  be  identical  for  a large  set  of  the 
experimental  stimuli.  In  the  present  study,  while  many  of  the 
models  fit  the  mean  data  quite  well  (viz..  Tables  4 and  5), 
pairwise  comparisons  of  models  on  critical  stimuli  for  which  the 
models  made  differential  predictions  resulted  in  dramatic 
differences  in  the  proportion  of  confirmations  for  the  models  (as 
in  Table  3) . 

It  may  be  concluded  Irom  these  observations  that  studies  of 
concept.  learning  and  recognition  in  which  only  one  theory  is 
considered,  supported  by  confirmatory  evidence#  are 
methodologically  suspect.  Several  previous  studies  have  produced 
data  confirming  a particular  theory,  but  no  argument  against 
using  the  same  data  to  support  a number  of  equally  plausible 
alternative  theories.  In  the  present  experiment  mean  recognition 
and  classification  performance  were  predicted  at  statistically 
significant  levels  by  several  models.  Yet  it  was  also  possible 
to  reject  most  of  those  alternatives  by  means  of  the  within- 
subject  pairwise  comparisons  of  models.  While  it  is,  of  course. 
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impossible  to  compare  all  possible  alternative  theories  in 
studies  of  concept  learning  or  recognition,  it  is  obviously 
undesirable  to  propose  a theory  supported  by  data  that  can  be 
taken  in  support  of  a number  of  alternatives  as  veil.  Bather,  an 
attempt  was  made  here  to  enumerate  a set  of  well-known  and 
reasonable  alternative  recognition  and  classification  theories, 
desiqn  a testbed  in  which  differential  predictions  of  the 
theories  could  be  evaluated,  and  provide  an  analytic  procedure  to 
provide  as  much  discrimination  among  the  alternatives  as 
possible. 
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Appendix  I 

Strenqth  and  Distance  Models  for  Recognition  Performance 

In  specifvinq  the  models,  three  conventions  have  been 
adopted.  First,  since  all  test  exemplars  comprised  only  the 
three  criterial  teatures  (age,  education,  and  marital  status), 
memory  representations  and  distance  and  strength  metrics  were 
based  only  on  those  features.  Names  and  hobbies  were  assigned 
randomly  to  the  originally  classified  exemplars,  so  they 
should  not  influence  any  of  the  models*  predictions. 
Second,  all  models  assume  that  each  memory 

representation  is  associated  with  an  appropriate  class 
designator.  Third,  we  distinguished  type  and  token  exemplars. 
An  exemplar  type  is  any  set  of  the  three  criterial  features 
that  occurred  in  at  least  one  initially  classified  exemplar. 
Kach  occurrence  of  ar  exemplar  type  is  an  exemplar  token.  The 
following  are  brief  descriptions  of  alternative  models. 

(R1)  Simple  property-set  t peguenc ies.  Every  property-set  of 
the  initially  classified  exemplars  is  stored  in  memory.  Each  new 
occurrence  of  a prcpeity-set  increments  its  strength  value. 
Recognition  confidence  for  a test  exemplar  should  be  an 
increasing  function  cf  the  frequencies  of  all  property-sets  in 
memory  that  are  contained  in  the  test  exemplar.  This  model  makes 
predictions  only  when  one  test  exemplar  dominates  another  in  all 
corresponding  property-set  treguencies. 

f R 2 ) Within  -class  proper ty-sct  frequencies.  Every 


property-set  of  the  initially  classified  exemplars  is  stored  in 
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memory.  Each  property-set  has  a tag  tor  each  ot  the  concept 
classes  represented  by  initially  classified  exemplars.  Each  new 
occurrence  of  a ptoperty-set  increments  its  strength  value  for 
only  the  concept  class  represented  by  its  occurrence. 
Recognition  confidence  should  be  an  increasing  function  of  the 
frequencies  of  the  property- se ts  in  the  same  concept  class  as  the 
test  exemplar.  This  model  makes  predictions  only  when  one  test 
exemplar  dominates  another  on  all  corresponding  property-set 
f requencies. 

For  models  R3  - RIO  the  distance  between  two  exemplars  is 
defined  as  the  number  of  dimensions  on  which  they  differ  (the 
Hamminq  distance) . 

(R3)  Prototype- plus- transformat  ions.  Only  prototypes  tor 
each  of  the  three  concepts  are  abstracted  and  stored  in  memory. 
Recognition  confidence  should  be  a decreasing  function  of  the 
distance  from  the  test  exemplar  to  the  nearer  of  the  two  club 
protot  ypes. 

ALTERNATIVE  DISTANCE  MODELS 

(R4)  Minimum  total  distance  to  exemplar  types  of  either 
c lass.  Each  initially  classified  Club  1 or  2 exemplar  type 
is  stored  in  memory.  Recognition  confidence  should  be  a 
decreasing  function  of  the  minimum  sum  of  the  distances  from 
the  test  exemplar  to  the  initially  classified  exemplar  types  of 
a sinqle  class. 

( R5 ) Minimum  to ta 1 distance  to  exemplar  tokens  of  either 
c lass . Each  initially  classified  Club  1 or  2 exemplar  token 
is  stored  in  memory.  Recognition  confidence  should  be  a 
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decreasinq  function  of  the  minimum  sum  of  the  distances  from 
the  test  exemplar  to  the  initially  classified  exemplar  tokens 
of  a single  class. 

(R6)  Total  distance  to  exemplar  types  of  both  classes.  Each 
initially  classified  Club  1 or  2 exemplar  type  is  stored  in 
memory.  Recognition  confidence  should  be  a decreasing  function 
of  the  sum  of  the  distances  from  the  test  exemplar  to  the 
initially  classified  exemplar  types  of  both  classes. 

(R7)  Total  d i stance  to  e xernp  la  r tokens  of  both  cl  asses . 
each  initially  classified  Club  1 or  2 exemplar  token  is  stored 
in  memory.  Recognition  confidence  should  be  a decreasing 
function  of  the  sum  of  the  distances  from  the  test  exemplar 
to  the  initially  classified  exemplar  tokens  of  both  classes. 

(R8)  Total  distance  to  all  presentation  types.  Every 
initially  classified  exemplar  type  is  stored  in  memory. 
Recognition  confidence  should  be  a decreasing  function  of  the  sum 
of  the  distances  from  the  test  exemplar  to  the  initially 
classified  exemplar  types. 

(F9)  Total  d is t a nee  to  all  presentation  tokens.  Every 
initially  classified  exemplar  token  is  stored  in  memory. 
Recognition  confidence  should  be  a decreasing  function  of 
the  sum  of  the  distances  from  the  test  exemplar  to  the 
initially  classified  exemplar  tokens. 

( H 1 0)  Nearest  neighbor.  Each  initially  classified  Club  1 or 
2 exemplar  type  (or  token)  is  stored  in  memory.  Recognition 
confidence  should  be  a decreasing  function  of  the  minimum 
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distance  from  the  test  exemplar  to  any  stored  exemplar. 

(R11-R18)  Models  R11-R18  ace  the  same  as  models 

R3-R10  except  that  the  distance  between  two  items  is  defined 

as  a three-tuple  (d  , d , d ).  d equals  zero  (one)  if  the 

12  3 i 

two  items  are  identical  (different)  on  dimension  i (i  = 1, 

2,  3).  These  models  predict  that  an  exemplar  X should  be 
recoqnized  better  than  an  exemplar  Y if  the  distances  asso- 
ciated with  x and  y,  (x  , x , x ) and  (y  , y , y ) , satisfy 

12  3 12  3 

the  conditions:  y > x or  y = x (i  = 1#  2,  3)  and  y > x 

i i i i i i 

for  at  least  one  i.  If  y = x (i  = 1 , 2,  3)  , the  models 

i i 

predict  equal  recoqnition  of  X and  Y. 

ALTERNATIVE  STRENGTH  MODELS 

( R 1 9)  Frequency  of  presentation . Each  initially  classified 
exemplar  token  is  stored  in  memory.  Recognition  confidence 
should  be  an  increasing  function  of  the  presentation  frequency 
of  the  test  exemplar  type. 

(B20)  With  in-class  frequency  of  presentation.  Each 
initially  classified  exemplar  token  is  stored  in  memory. 
Recognition  confidence  should  be  an  increasing  function  of  the 
maximum  frequency  of  the  test  exemplar  type  in  the  appropriate 
c lass. 

(R21)  Feature  frequencies.  Each  criterial  feature  in  the 
initially  classified  exemplars  is  stored  in  memory. 
Recoqnition  confidence  should  be  an  increasing  function  of  the 
frequencies  of  the  features  in  the  test  exemplar  among  all 
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presented  exemplars.  This  model  makes  predictions  only  when 
one  test  exemplar  dominates  another  on  all  corresponding 
feature  frequencies. 

( h 2 2 ) w i t h in-c_lass  feature  f requencies.  Each  crit.erial 
feature  in  the  initially  classified  exemplars  is  stored 
in  memory.  Recognition  confidence  should  be  an  increasing 
function  of  the  frequencies  of  the  features  in  the  test 
exemplar  among  presented  exemplars  of  the  same  class  as  the 
test  exemplar.  This  model  makes  predictions  only  when  one 
test  exemplar  dominates  another  on  all  corresponding  feature 
frequencies. 

(R23)  Sum  of  fea tu re  frequencies.  Each  criteria!  feature  in 
the  initially  classified  exemplars  is  stored  in  memory. 


Recognition 

confidence  should 
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ar  increasing 
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the  sum  of 

the  frequencies  of 

the 

features  in  the 

test 

exemplar 

among  all  presented  exemplars. 

(P24)  Within-cjass  sum  of  feature  frequencies.  Each 
criterial  feature  in  the  initially  classified  exemplars  is 
stored  in  memory.  Recognition  confidence  should  be  an 
increasing  function  of  the  sum  ot  the  frequencies  of  the  features 
in  the  test  exemplar  amonq  presented  exemplars  of  the  same  class 
as  the  test  exemplar. 

(R25)  lost  frequent  prgper  t_y-se  t . Every  property-set 
encoded  for  the  initially  classified  exemplars  is  stored  in 
memory.  Recoqniticr  confidence  should  fce  an  increasing  function 
of  the  frequency  of  the  most  frequent  property-set  encoded  tor 
the  test  exemplar  amonq  all  presented  exemplars. 
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(R26)  Within-class  most  frequent  property -set.  Every 
property-set  encoded  for  the  initially  classified  exemplars 
is  stored  in  memory.  Recognition  confidence  should  he  an 


increasing 

function  of 
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test 
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Appendix  II 

Strenqth  and  Distance  Models  for  Classification  Performance 

Twelve  classification  models  were  evaluated.  For  models 
C2-C8,  the  distance  between  two  exemplars  was  defined  as  the 
number  of  dimensions  on  which  they  differed.  It  was  impossible 
to  apply  three-tuple  distances  (as  in  recognition  models  R9-R16), 
because  they  do  not  provide  a basis  for  determining  relative 
distances  to  the  exemplars  or  prototypes  of  alternative  classes. 

(Cl)  Classical  most  diagnostic  property-set . All  property- 
sets  encoded  for  the  initially  classified  exemplars  are  stored 
in  memory.  A test  exemplar  should  be  classified  in  the  class 
associated  with  its  most  diagnostic  property-set  (classical 
estimator  = (frequency  of  the  property-set  in  class 
i ) / ( f requency  of  the  property-set  in  both  classes)). 
Confidence  should  be  an  increasirq  turction  of  the 
classical  likelihood  estimator  of  the  most  diagnostic  property- 
set. 


ALTERNATIVE  distance  models 
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(C 3 ) Difference  between  distances  to  both  £rot_otypes . only 


the  two  class  prototypes  are  stored  in  meirory.  A test  exemplar 
should  be  classified  iri  the  class  associated  with  the 
nearer  prototype.  confidence  should  be  an  increasing  function 
of  the  absolute  value  of  the  difference  between  the  distances 
from  the  exemplar  to  each  prototype. 

(C4 } Minimum  sum  of  distances  to  e xemp I ar  t_y£es . Each 
initially  classified  exemplar  type  is  stored  in  memory.  A test 
exemplar  should  be  classified  in  the  class  associated  with 
the  minimum  sum  of  distances  from  the  exemplar  to  the 
class's  exemplar  types.  Confidence  should  be  a decreasing 
function  of  the  sum  of  distances. 

{ C 5 ) Difference  between  sums  of  distances  to  exemplar  t ypes 
gl  two  classes.  Each  initially  classified  exemplar  type  is 
stored  in  memory.  A test  exemplar  should  be  classified  in  the 
class  associated  with  the  minimum  sum  of  distances  from  the 
exemplar  to  the  class's  exemplar  types.  confidence  should  be 
an  increasing  function  of  the  absolute  value  of  the  difference 
between  the  sums  of  distances  from  the  test  exemplar  to  the 
exemplar  types  of  the  two  classes. 

(C6)  Minimum  sum  of  distances  to  e xemplar  tokens.  Each 
initially  classified  exemplar  token  is  stored  in  memory.  A 
test  exemplar  should  be  classified  in  the  class  associated  with 
the  minimum  sum  of  distances  from  the  exemplar  to  the 
class's  exemplar  tokens.  Confidence  should  be  a decreasing 
function  of  the  sum  of  distances. 

( C 7 ) Difference  between  sums  of.  distances  to  exemplar  tokens 
of.  two  classes.  Each  initially  classified  exemplar  token  is 
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stored  in  memory.  A test  exemplar  should  be  classified  in  the 
class  associated  with  the  minimum  sum  of  distances  from  the 
exemplar  to  the  class's  exemplar  tokens.  Confidence  should  be 
an  increasing  function  of  the  absolute  value  of  the  difference 
between  the  sums  of  distances  from  the  test  exemplar  to  the 
exemplar  tokens  of  the  two  classes. 

(C8)  Nearest  neighbor.  Each  initially  classified  exemplar 
type  is  stored  in  memory.  A test  exemplar  should  be 
classified  in  the  class  associated  with  its  nearest  neighbor 
(minimum  distance)  in  memory.  Confidence  should  be  a 
decreasing  function  of  the  distance  from  the  exemplar  to  its 
nearest  neighbor. 

ALTEOA1IVE  STRENGTH  MODELS 

( C9 ) Sum  of  between -class  proper ty-set  frequencies.  All 
property-sets  encoded  for  the  initially  classified  exemplars 
are  stored  in  memory.  A test  exemplar  should  be  classified  in 
class  i if  all  cf  its  property-sets  have  occurred  more 
frequently  amonq  the  exemplars  of  class  i than  class  j. 
Confidence  should  be  an  increasing  function  of  the  sum  of  the 
differences  between  frequencies  of  association  between 
corresponding  property-sets  and  the  two  classes. 

(CIO)  Bayesian  most  diagnostic  property- set.  All  property- 
sets  encoded  tor  the  initially  classified  exemplars  are  stored 
in  memory.  A test  exemplar  should  be  classified  in  the  class 
associated  with  its  most  diagnostic  property-set  (Bayesian 
estimator  = (frequency  of  the  property-set  in  class  i ♦ 
1 )/ (frequency  of  the  property-set  in  both  classes  ♦ 2)). 
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Confidence  should  te  an  increasing  function  of  the  Bayesian 
likelihood  estimator  ot  the  most  diagnostic  property-set. 

(C 1 1 ) Frequency  of  presentation  in  either  class.  Initially 
classified  exemplar  tokens  are  stored  in  memory.  A test 
exemplar  should  be  classified  in  the  class  most  frequently 
associated  with  its  type  in  memory.  Confidence  should  be  an 
increasing  function  of  the  frequency  of  presentation  of  the 
exemplar  type  in  the  class. 

(Cl  2)  Dif  ference  between  presentation  frequencies  in  the  two 
classes.  Initially  classified  exemplar  tokens  are  stored  in 
memory.  A test  exemplar  should  be  classified  in  the  club 
most  frequently  associated  with  its  type  in  memory. 
Confidence  should  te  an  increasing  function  of  the  absolute 
value  of  the  difference  between  its  presentation  frequencies  in 
the  two  clubs. 
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. We  thank  L.  T.  Frase,  S.  K.  liced,  J.  S.  Keitman  and  F.  B. 
Hunt  roc  helpful  criticism  ot  earlier  versions  of  this 
paner.  Perry  Thorndyke  deserves  special  thanks  for  gen- 
erously  sharing  his  talents  with  us  in  writing  the  nnai 
version  of  the  paper.  The  research  reported  here  was 
conducted  while  Barbara  Hayes-Hot  ii  was  employed  at  Bell 
Laboratories  and  Frederick  Hayes-Roth  was  employed  at 
Car  ncqie-  Me  1 1 on  Uni  vers i t.  y . 

Requests  tor  reprints  should  be  sent  to:  E.  Hayes-Roth, 
The  Hand  Corporation,  1700  Main  Street,  Santa  Monica, 
California,  9040b. 

3.  The  powerset  of  a set  is  the  sc-t  of  all  of  its  subsets. 
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